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Abstract 

Background: Studies adopting electronic medical records and genomic information are becoming widespread. 
Through this new modality in research, it is possible to study how genetic variants influence susceptibility towards 
chronic conditions and can improve patient care. 

Our aim is to develop a biobank with 2,000 heart failure patients treated in a tertiary cardiology hospital containing 
electronic medical records data and biologic samples for performing genome-wide association studies for validation 
and development of medical decision routines aimed at helping the clinical management of patients. 

Methods/Design: Patients between 18 and 80 years old with heart failure diagnosis of different etiologies and left 
ventricular ejection fraction < 50% in the past 2 years will be eligible for enrollment on the cohort. After consent, 
patients will be submitted to clinical baseline, echocardiography, cardiograph impedance and biochemical 
evaluation. Study data will be collected and managed using Research Electronic Data Capture tools. The follow up 
will take place every 6 months to assess cardiovascular outcomes (all-cause mortality, cardiovascular mortality, 
hospitalization for worsening heart failure and current medication use). Initial analytical strategy will focus on the 
establishment of the accuracy of electronic medical records extraction protocols for main predictor factors of 
morbidity and mortality in heart failure. 

Discussion: Building a biobank with biologic samples and clinical data of 2,000 heart failure patients we will 
perform genome-wide association studies. By this way, we pretend to study how genetic variants influence 
susceptibility towards chronic conditions. Besides, it will be created a working group focused on the development 
and implementation of algorithms for validation and application of medical routines using the electronic medical 
records of the Heart Institute (InCor - HCFMUSP). 

Trial registration: Current Controlled Trials NTC02043431. 
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Background 

Heart failure is an important cause of morbidity and 
mortality worldwide and in Latin America, the risk factors 
and epidemiology have peculiarities that are still unclear. 
Recently, Bocchi et cols discussed the similarities in risk 
factors compared to developed countries highlighting the 
prevalence of others causes like Chagas'disease, rheumatic 
fever and lower total expenditure on health per capita [1]. 

The mapping of the human genome has enabled new 
exploration on how genetic variation contributes to health 
and disease [2]. Technological advances in the under- 
standing of disease processes are presented as new oppor- 
tunities to improve healthcare through management of 
individual patients. In this context, the adoption of elec- 
tronic medical records (EMR), used by clinicians to docu- 
ment clinical care, is becoming widespread and recent 
studies demonstrate that they can be effectively employed 
for genetic studies using the informational and biological 
'by-products' of healthcare delivery while maintaining pa- 
tient privacy [3]. 

This new approach in research leads on the Personalized 
Medicine Research Project, a population-based biobank 
that involves discussions about community consultation 
and other legal aspects [4]. Actually, efforts to improve pa- 
tients' understanding of their own medical treatments or 
research in which they are involved are progressing, espe- 
cially with regard to informed consent procedures [5]. 

Recently, the National Human Genome Research In- 
stitute (NHGRI) called on researchers to determine how 
genetic variants influence susceptibility towards chronic 
conditions such as diabetes, Alzheimer's disease, and car- 
diovascular disease, in order to ultimately improve patient 
care. For this purpose, the eMERGE (electronic MEdical 
Records and GEnomics) Network - an NHGRI-supported 
consortium of five institutions - was created to explore 
the utility of DNA repositories coupled to EMR systems 
for advancing discovery in genome science [6]. 

Allied to this new modality in research, the adoption 
of an online database allows easy access and processing 
of information collected in a safe and efficient way. The 
Research electronic data capture (REDCap) is a novel 
workflow methodology and software solution designed 
for rapid development and deployment of electronic data 
capture tools to support clinical and translational research. 
It was initially developed and deployed at Vanderbilt Uni- 
versity, but now has collaborative support from a wide 
consortium of domestic and international partners [7]. 

It is clear that new strategies are essential for preven- 
tion and treatment of chronic diseases like heart failure. 
Conditions described in Latin America that has peculiar 
epidemiology compared to other countries, need further 
studies. In this scenario, our aim is to develop a biobank 
containing samples of 2,000 patients with heart failure diag- 
nosis treated in a tertiary cardiology hospital containing 



electronic medical record and genetic data in genome-wide 
scale for performing genetic association studies for valid- 
ation and development of medical decision routines that 
help the clinical management of these patients. 

Methods/Design 

Research design 

Subjects with systolic heart failure, sequentially included, 
will be submitted to clinical baseline, echocardiography, 
cardiograph impedance and biochemical evaluation, which 
include blood, urine and exhaled breath acetone samples. 
It will allow the authors to best define the heart failure 
syndrome, its etiology and its hemodynamic profile. All 
patients included will also be submitted to clinical 
follow-up every 6-month for 3 years to assess cardio- 
vascular outcomes: all-cause mortality, cardiovascular 
mortality, hospitalization for worsening heart failure, 
current medication use and heart transplantation. The 
diagnosis of heart failure will be made according to pre- 
viously published criteria [8]. To define the appropriate 
etiology of cardiomyopathy, the authors will follow pre- 
vious guidelines [9,10]. 

Patients selection 

The individuals invited to our study will be consecutively 
selected from the Heart Institute - Clinical Hospital- Uni- 
versity of Sao Paulo Medical School (InCor - HCFMUSP) 
on their outpatient admission, starting at January 2011. 

Ethics 

The study protocol was approved by the Ethics Commit- 
tee for Medical Research on Human Beings of the Clinical 
Hospital from University of Sao Paulo Medical School 
(SDC 2368/03/162). Signed informed consent will be ob- 
tained from all participants. 

Inclusion criteria 

Patients between 18 and 80 years old with heart failure 
diagnosis of different etiologies and left ventricular ejec- 
tion fraction < 50% on two-dimensional transthoracic 
Doppler echocardiography performed in the past 2 years, 
who are admitted to the Heart Institute from January 
2011, will be eligible for enrollment on the cohort. Spe- 
cifically, patients with cardiomyopathy due to valvular 
heart disease enrolled in the study will be those with se- 
vere left ventricular dysfunction to the point that they 
would not be eligible for valve repair or replacement, 
but rather candidates for heart transplantation. 

Exclusion criteria 

Patients with impaired cognition due to advanced de- 
mentia syndrome or severe psychiatric disorder, without 
telephone access or that refuse to participate will be 
excluded. 
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Left ventricular function assessment 

During echocardiography, transthoracic images will be 
obtained with the patient in the left lateral position using 
Acuson Sequoia 512 (Siemens). The image acquisitions 
will be standardized in the four windows: parasternal 
long and short axis, and apical 4 - and 2-chambers. The 
evaluation of the left ventricular (LV) function will be 
performed by the echocardiography staff in a blinded 
way in relation to the protocol and will be conducted ac- 
cording to previously standard recommendations. Left 
ventricular systolic and diastolic volumes and ejection 
fraction will be derived from biplane apical views using 
the modified Simpson's rule algorithm. Diastolic dys- 
function will be determined using Doppler measure- 
ments of mitral inflow and Doppler tissue imaging (DTI) 
of the mitral annulus. 

Biobanking 

All enrolled patients will have fasting serum, exhaled 
breath acetone, plasma and urine samples collected and 
stored at -80°C for further analysis. 

Genomic DNA protocol 

Genomic DNA will be extracted from peripheral leuco- 
cytes after a standard salting out protocol [11]. DNA 
quality and integrity will be checked in all samples after 
extraction protocol. 

Whole-blood RNA collection 

The enrolled patients will have whole-blood RNA col- 
lected in fasting state and stored in PAXGene tubes for 
further analysis according to standard protocol [12]. 

Exhaled breath acetone (EBA) 

The exhaled breath acetone has been described as a good 
biomarker of the diagnosis of heart failure as previously 
described [13] and will be used associated to BNP (B-type 
natriuretic peptide) and clinical criteria. After collection of 
blood samples, patients will have standard diet and will be 
submitted to the exhaled breath acetone collection by 
using a simple and portable device. After collection, the 
liquid phase of the sample will be transferred from the 
device to a capped vial and stored in a freezer at -80°C 
until acetone analysis. 

Studied variables 

Enrolled patients will have the opportunity to contribute 
to genetic associations regarding a number of demo- 
graphic, clinical and follow-up information. All included 
individuals will have data on sex, ethnicity, age, duration 
of symptoms, etiology of heart failure, history of diabetes 
mellitus, arterial hypertension, smoking, drugs and mor- 
bidity outcomes. Anthropometric data like body mass 
index, heart rate, blood pressure, cardiac rhythm, cardiac 



dimensions on echocardiography and left ventricle ejec- 
tion fraction (both baseline and sequential in one year 
intervals). We will perform serum analyses (sodium, po- 
tassium, total cholesterol, triglycerides, HDL-cholesterol, 
LDL-cholesterol, VLDL-cholesterol, glucose, creatinine, 
urea, B-type natriuretic peptide), exhaled breath acetone 
dosage and urine analyze. Follow-up will be assessed 
every 6 months to investigate clinical outcomes as cited 
above. In addition, the mortality database of Sao Paulo 
City Authority will also be periodically scrutinized to 
discover patient deaths (ProAim-Programa de Aprimor- 
amento de Informacoes de Mortalidade do Municipio de 
Sao Paulo). Follow-up will be carried for all individuals 
until a 3-year period. 

Electronic data collection (REDCap) 

Study data will be collected and managed using REDCap 
electronic data capture tools hosted at the Clinical Hos- 
pital from University of Sao Paulo Medical School. RED- 
Cap (Research Electronic Data Capture) is a secure, web- 
based application designed to support data capture for 
research studies, providing: 1) an intuitive interface for 
validated data entry; 2) audit trails for tracking data ma- 
nipulation and export procedures; 3) automated export 
procedures for seamless data downloads to common stat- 
istical packages; and 4) procedures for importing data 
from external sources [7]. 

Cardiograph impedance 

Hemodynamic data will be obtained noninvasively using 
a monitor CardioScreen 2000 (Medis, Germany) approved 
for marketing by U.S. Food and Drug Administration 
(FDA). This device measures the thoracic bioimpedance 
using the following technique: four dual sensors (each sen- 
sor consists of two electrodes) are placed on the patient, 
on opposite sides of the neck, at a level between the ears 
and shoulders and on both sides of the chest in the 
mid-axillary line at the level of the xiphoid process. The 
electrodes transmit a current of low amplitude and high 
frequency (2.5 mA and 70 kHz), and detect changes in 
voltage chest. These alterations in voltage are used to 
calculate the impedance (Z). The following parameters 
will be assessed: Cardiac index (CI), Stroke Volume 
Indexed (SVI), Mean Arterial Pressure (MAP), systemic 
vascular resistance index (SVRI), Thoracic Fluid Con- 
tent Indexed (CLTI), velocity index, acceleration index, 
pre-ejection period, LV ejection time, and systolic time 
ratio (inversely proportional to LV function; pre-ejection 
period divided by LV ejection time). 

Genome-wide association studies (GWAS) 

The genomic DNA samples will be processed and geno- 
typed in the Affymetrix biobank platform for around 
800.000 polymorphic markers. For quality control, will 



Gioli-Pereira et al. BMC Cardiovascular Disorders 2014, 14:32 
http://www.biomedcentral.eom/1 471 -2261 /1 4/32 



Page 4 of 5 



be deleted SNPs with genotypes not identified in > 20% 
of the samples, SNPs with allele frequency rarer (MAF) 
<1%, or results to Hardy- Weinberg test <0.001. Further, 
after exclusion of SNPs with quality problems in geno- 
typing, individuals will be analyzed for consistency of 
genotyping. Individuals with genotyping rate <90% will 
be excluded. 

Development of diagnostic algorithms based on 
electronic medical records 

The Heart Institute (InCor - HCFMUSP) had developed 
and used an in-house system of electronic medical re- 
cords in the last 2 decades. This information system has 
medical data that need to be validated for use in system- 
atic clinical research. Using these medical records we 
pretend to create algorithms for data extraction that en- 
able selection and classification of individuals, or the 
extraction of continuous variables of medical interest 
that can be used in research protocols or development 
of medical decision tools. In summary, we intent to 
evaluate the validity and specificity of this data collection 
system and apply it in genetic association studies as well 
as medical routines of the Institute. 

Statistical analysis 

Data will be analyzed using SPSS for Windows version 
13.0 (SPSS Inc) statistical software. 

Discussion 

We expect to develop a biobank containing samples and 
clinical data of 2,000 patients with heart failure diagnosis 
for performing genetic association studies. Besides, we 
aim to create a working group focused on the develop- 
ment and implementation of algorithms for validation 
and application of medical decision routines using our 
electronic medical record system in order to help the 
clinical management of patients at the Heart Institute 
(InCor - HCFMUSP). Furthermore, the project will focus 
on ethical, legal, political and social impact of this new 
reality, such as privacy, confidentiality and possible in- 
teractions with the community. In this sense, we intend 
to identify sources of concern among participants re- 
garding current approaches to obtaining informed con- 
sent for storage and use of genetic material in clinical 
research. 

In this way, with the appropriate use of genetic data in 
genome-wide scale associated to electronic medical in- 
formation, we expect to accelerate both the discovery of 
genetic factors associated with complex diseases such as 
heart failure and create medical decision routines that 
help the clinical management of patients in a tertiary 
cardiology hospital. 
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